Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 5271 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 715.6 KiB |
| Average record size in memory | 139.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 9 |
Ascites is highly overall correlated with Edema_Y | High correlation |
Bilirubin is highly overall correlated with Copper | High correlation |
Copper is highly overall correlated with Bilirubin | High correlation |
Edema_N is highly overall correlated with Edema_S and 1 other fields | High correlation |
Edema_S is highly overall correlated with Edema_N | High correlation |
Edema_Y is highly overall correlated with Ascites and 1 other fields | High correlation |
Hepatomegaly is highly overall correlated with Stage | High correlation |
Stage is highly overall correlated with Hepatomegaly | High correlation |
is_male is highly imbalanced (61.7%) | Imbalance |
Ascites is highly imbalanced (73.0%) | Imbalance |
Edema_N is highly imbalanced (55.7%) | Imbalance |
Edema_S is highly imbalanced (71.5%) | Imbalance |
Edema_Y is highly imbalanced (74.7%) | Imbalance |
Reproduction
| Analysis started | 2024-01-03 07:56:24.553557 |
|---|---|
| Analysis finished | 2024-01-03 07:56:35.653894 |
| Duration | 11.1 seconds |
| Software version | ydata-profiling vv4.6.3 |
| Download configuration | config.json |
N_years
Real number (ℝ)
| Distinct | 409 |
|---|---|
| Distinct (%) | 7.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.5854703 |
| Minimum | 0.11232877 |
|---|---|
| Maximum | 13.136986 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 41.3 KiB |
Quantile statistics
| Minimum | 0.11232877 |
|---|---|
| 5-th percentile | 0.91506849 |
| Q1 | 3.3808219 |
| median | 5.1561644 |
| Q3 | 7.3753425 |
| 95-th percentile | 11.463014 |
| Maximum | 13.136986 |
| Range | 13.024658 |
| Interquartile range (IQR) | 3.9945205 |
Descriptive statistics
| Standard deviation | 2.9776534 |
|---|---|
| Coefficient of variation (CV) | 0.53310702 |
| Kurtosis | -0.46751299 |
| Mean | 5.5854703 |
| Median Absolute Deviation (MAD) | 1.9013699 |
| Skewness | 0.42978853 |
| Sum | 29441.014 |
| Variance | 8.8664198 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.331506849 | 82 | 1.6% |
| 3.928767123 | 61 | 1.2% |
| 2.106849315 | 52 | 1.0% |
| 5.156164384 | 46 | 0.9% |
| 9.438356164 | 42 | 0.8% |
| 4.890410959 | 41 | 0.8% |
| 0.9150684932 | 39 | 0.7% |
| 6.284931507 | 39 | 0.7% |
| 6.093150685 | 37 | 0.7% |
| 5.967123288 | 36 | 0.7% |
| Other values (399) | 4796 |
| Value | Count | Frequency (%) |
| 0.1123287671 | 17 | |
| 0.1178082192 | 2 | < 0.1% |
| 0.1397260274 | 14 | |
| 0.1863013699 | 1 | < 0.1% |
| 0.1945205479 | 8 | |
| 0.2109589041 | 10 | |
| 0.301369863 | 13 | |
| 0.3561643836 | 4 | 0.1% |
| 0.3589041096 | 11 | |
| 0.3753424658 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 13.1369863 | 7 | 0.1% |
| 12.48219178 | 26 | |
| 12.39178082 | 9 | 0.2% |
| 12.35342466 | 15 | |
| 12.32876712 | 26 | |
| 12.23835616 | 12 | |
| 12.21643836 | 24 | |
| 12.2 | 20 | |
| 12.12876712 | 8 | 0.2% |
| 11.95890411 | 28 |
Age
Real number (ℝ)
| Distinct | 363 |
|---|---|
| Distinct (%) | 6.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.678786 |
| Minimum | 26.29589 |
|---|---|
| Maximum | 78.493151 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 41.3 KiB |
Quantile statistics
| Minimum | 26.29589 |
|---|---|
| 5-th percentile | 33.717808 |
| Q1 | 43.09589 |
| median | 51.523288 |
| Q3 | 56.668493 |
| 95-th percentile | 67.046575 |
| Maximum | 78.493151 |
| Range | 52.19726 |
| Interquartile range (IQR) | 13.572603 |
Descriptive statistics
| Standard deviation | 9.8189008 |
|---|---|
| Coefficient of variation (CV) | 0.19374775 |
| Kurtosis | -0.41688082 |
| Mean | 50.678786 |
| Median Absolute Deviation (MAD) | 6.8657534 |
| Skewness | -0.019642748 |
| Sum | 267127.88 |
| Variance | 96.410813 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 52.21917808 | 47 | 0.9% |
| 56.43013699 | 47 | 0.9% |
| 61.28493151 | 42 | 0.8% |
| 61.3369863 | 40 | 0.8% |
| 55.49041096 | 39 | 0.7% |
| 40.74520548 | 38 | 0.7% |
| 52.03561644 | 38 | 0.7% |
| 61 | 37 | 0.7% |
| 44.6 | 37 | 0.7% |
| 62.90410959 | 37 | 0.7% |
| Other values (353) | 4869 |
| Value | Count | Frequency (%) |
| 26.29589041 | 10 | |
| 28.90410959 | 13 | |
| 29.57534247 | 4 | 0.1% |
| 29.69589041 | 1 | < 0.1% |
| 30.29589041 | 7 | |
| 30.59452055 | 15 | |
| 30.88493151 | 16 | |
| 31.22739726 | 1 | < 0.1% |
| 31.40273973 | 16 | |
| 31.46575342 | 11 |
| Value | Count | Frequency (%) |
| 78.49315068 | 19 | |
| 76.76164384 | 5 | 0.1% |
| 75.0630137 | 19 | |
| 75.05205479 | 1 | < 0.1% |
| 74.57534247 | 14 | |
| 72.82191781 | 6 | 0.1% |
| 71.94246575 | 4 | 0.1% |
| 71.40821918 | 1 | < 0.1% |
| 70.95616438 | 14 | |
| 70.88493151 | 4 | 0.1% |
is_male
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 309.0 KiB |
| 0.0 | |
|---|---|
| 1.0 | 394 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 15813 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 4877 | |
| 1.0 | 394 | 7.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 4877 | |
| 1.0 | 394 | 7.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 10148 | |
| . | 5271 | |
| 1 | 394 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10542 | |
| Other Punctuation | 5271 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 10148 | |
| 1 | 394 | 3.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5271 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15813 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 10148 | |
| . | 5271 | |
| 1 | 394 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15813 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 10148 | |
| . | 5271 | |
| 1 | 394 | 2.5% |
Ascites
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 298.7 KiB |
| 0 | |
|---|---|
| 1 | 244 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5271 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 5027 | |
| 1 | 244 | 4.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 5027 | |
| 1 | 244 | 4.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5027 | |
| 1 | 244 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5271 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5027 | |
| 1 | 244 | 4.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5271 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5027 | |
| 1 | 244 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5271 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5027 | |
| 1 | 244 | 4.6% |
Hepatomegaly
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 298.7 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5271 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 2730 | |
| 0 | 2541 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 2730 | |
| 0 | 2541 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2730 | |
| 0 | 2541 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5271 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2730 | |
| 0 | 2541 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5271 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2730 | |
| 0 | 2541 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5271 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2730 | |
| 0 | 2541 |
Spiders
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 298.7 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5271 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3972 | |
| 1 | 1299 | 24.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3972 | |
| 1 | 1299 | 24.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3972 | |
| 1 | 1299 | 24.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5271 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3972 | |
| 1 | 1299 | 24.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5271 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3972 | |
| 1 | 1299 | 24.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5271 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3972 | |
| 1 | 1299 | 24.6% |
Bilirubin
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 108 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.6003889 |
| Minimum | 0.3 |
|---|---|
| Maximum | 28 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 41.3 KiB |
Quantile statistics
| Minimum | 0.3 |
|---|---|
| 5-th percentile | 0.5 |
| Q1 | 0.7 |
| median | 1.1 |
| Q3 | 3 |
| 95-th percentile | 9.95 |
| Maximum | 28 |
| Range | 27.7 |
| Interquartile range (IQR) | 2.3 |
Descriptive statistics
| Standard deviation | 3.8523952 |
|---|---|
| Coefficient of variation (CV) | 1.4814689 |
| Kurtosis | 13.729631 |
| Mean | 2.6003889 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 3.424573 |
| Sum | 13706.65 |
| Variance | 14.840949 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.6 | 609 | 11.6% |
| 0.9 | 422 | 8.0% |
| 0.8 | 405 | 7.7% |
| 0.7 | 398 | 7.6% |
| 0.5 | 388 | 7.4% |
| 1.1 | 294 | 5.6% |
| 1.3 | 231 | 4.4% |
| 1 | 174 | 3.3% |
| 0.4 | 123 | 2.3% |
| 3.2 | 111 | 2.1% |
| Other values (98) | 2116 |
| Value | Count | Frequency (%) |
| 0.3 | 30 | 0.6% |
| 0.4 | 123 | 2.3% |
| 0.5 | 388 | |
| 0.6 | 609 | |
| 0.7 | 398 | |
| 0.8 | 405 | |
| 0.9 | 422 | |
| 1 | 174 | 3.3% |
| 1.1 | 294 | |
| 1.2 | 99 | 1.9% |
| Value | Count | Frequency (%) |
| 28 | 14 | |
| 25.5 | 10 | 0.2% |
| 24.5 | 9 | 0.2% |
| 22.5 | 6 | 0.1% |
| 21.6 | 12 | 0.2% |
| 20 | 9 | 0.2% |
| 18 | 1 | < 0.1% |
| 17.9 | 17 | |
| 17.4 | 33 | |
| 17.2 | 4 | 0.1% |
Cholesterol
Real number (ℝ)
| Distinct | 222 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 352.48644 |
| Minimum | 120 |
|---|---|
| Maximum | 1775 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 41.3 KiB |
Quantile statistics
| Minimum | 120 |
|---|---|
| 5-th percentile | 201 |
| Q1 | 248 |
| median | 299 |
| Q3 | 390 |
| 95-th percentile | 652 |
| Maximum | 1775 |
| Range | 1655 |
| Interquartile range (IQR) | 142 |
Descriptive statistics
| Standard deviation | 200.43899 |
|---|---|
| Coefficient of variation (CV) | 0.56864313 |
| Kurtosis | 17.695544 |
| Mean | 352.48644 |
| Median Absolute Deviation (MAD) | 63 |
| Skewness | 3.6686841 |
| Sum | 1857956 |
| Variance | 40175.788 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 248 | 91 | 1.7% |
| 232 | 88 | 1.7% |
| 316 | 84 | 1.6% |
| 263 | 83 | 1.6% |
| 448 | 80 | 1.5% |
| 374 | 79 | 1.5% |
| 260 | 79 | 1.5% |
| 298 | 75 | 1.4% |
| 280 | 74 | 1.4% |
| 273 | 69 | 1.3% |
| Other values (212) | 4469 |
| Value | Count | Frequency (%) |
| 120 | 6 | 0.1% |
| 127 | 14 | 0.3% |
| 132 | 21 | |
| 149 | 3 | 0.1% |
| 151 | 10 | 0.2% |
| 168 | 8 | 0.2% |
| 172 | 6 | 0.1% |
| 174 | 10 | 0.2% |
| 175 | 39 | |
| 176 | 8 | 0.2% |
| Value | Count | Frequency (%) |
| 1775 | 12 | |
| 1712 | 11 | |
| 1600 | 12 | |
| 1480 | 9 | |
| 1336 | 12 | |
| 1280 | 1 | < 0.1% |
| 1276 | 12 | |
| 1236 | 1 | < 0.1% |
| 1232 | 1 | < 0.1% |
| 1128 | 13 |
Albumin
Real number (ℝ)
| Distinct | 154 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5380706 |
| Minimum | 1.96 |
|---|---|
| Maximum | 4.64 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 41.3 KiB |
Quantile statistics
| Minimum | 1.96 |
|---|---|
| 5-th percentile | 2.94 |
| Q1 | 3.35 |
| median | 3.57 |
| Q3 | 3.77 |
| 95-th percentile | 4.09 |
| Maximum | 4.64 |
| Range | 2.68 |
| Interquartile range (IQR) | 0.42 |
Descriptive statistics
| Standard deviation | 0.35488579 |
|---|---|
| Coefficient of variation (CV) | 0.10030489 |
| Kurtosis | 1.1571997 |
| Mean | 3.5380706 |
| Median Absolute Deviation (MAD) | 0.22 |
| Skewness | -0.57891344 |
| Sum | 18649.17 |
| Variance | 0.12594392 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.35 | 245 | 4.6% |
| 3.6 | 234 | 4.4% |
| 3.7 | 211 | 4.0% |
| 3.85 | 150 | 2.8% |
| 3.77 | 144 | 2.7% |
| 3.26 | 144 | 2.7% |
| 3.5 | 132 | 2.5% |
| 3.65 | 129 | 2.4% |
| 3.2 | 115 | 2.2% |
| 3.57 | 112 | 2.1% |
| Other values (144) | 3655 |
| Value | Count | Frequency (%) |
| 1.96 | 3 | 0.1% |
| 1.97 | 1 | < 0.1% |
| 2.1 | 5 | 0.1% |
| 2.23 | 3 | 0.1% |
| 2.27 | 2 | < 0.1% |
| 2.31 | 5 | 0.1% |
| 2.33 | 9 | 0.2% |
| 2.35 | 1 | < 0.1% |
| 2.43 | 29 | |
| 2.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 4.64 | 6 | 0.1% |
| 4.52 | 5 | 0.1% |
| 4.43 | 1 | < 0.1% |
| 4.4 | 8 | 0.2% |
| 4.38 | 10 | 0.2% |
| 4.3 | 37 | |
| 4.27 | 1 | < 0.1% |
| 4.24 | 9 | 0.2% |
| 4.23 | 14 | 0.3% |
| 4.22 | 10 | 0.2% |
Copper
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 164 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84.701679 |
| Minimum | 4 |
|---|---|
| Maximum | 588 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 41.3 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 39 |
| median | 65 |
| Q3 | 102 |
| 95-th percentile | 231 |
| Maximum | 588 |
| Range | 584 |
| Interquartile range (IQR) | 63 |
Descriptive statistics
| Standard deviation | 77.542064 |
|---|---|
| Coefficient of variation (CV) | 0.91547258 |
| Kurtosis | 11.653368 |
| Mean | 84.701679 |
| Median Absolute Deviation (MAD) | 29 |
| Skewness | 2.8617216 |
| Sum | 446462.55 |
| Variance | 6012.7717 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 52 | 228 | 4.3% |
| 67 | 227 | 4.3% |
| 75 | 142 | 2.7% |
| 39 | 134 | 2.5% |
| 20 | 130 | 2.5% |
| 58 | 126 | 2.4% |
| 38 | 111 | 2.1% |
| 13 | 111 | 2.1% |
| 41 | 105 | 2.0% |
| 44 | 104 | 2.0% |
| Other values (154) | 3853 |
| Value | Count | Frequency (%) |
| 4 | 6 | 0.1% |
| 9 | 27 | 0.5% |
| 10 | 14 | 0.3% |
| 11 | 44 | 0.8% |
| 12 | 21 | 0.4% |
| 12.7 | 1 | < 0.1% |
| 13 | 111 | |
| 14 | 32 | 0.6% |
| 15 | 9 | 0.2% |
| 16 | 7 | 0.1% |
| Value | Count | Frequency (%) |
| 588 | 19 | |
| 558 | 9 | |
| 464 | 20 | |
| 444 | 12 | |
| 412 | 3 | 0.1% |
| 380 | 21 | |
| 358 | 12 | |
| 308 | 3 | 0.1% |
| 290 | 15 | |
| 281 | 5 | 0.1% |
Alk_Phos
Real number (ℝ)
| Distinct | 362 |
|---|---|
| Distinct (%) | 6.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1811.2333 |
| Minimum | 289 |
|---|---|
| Maximum | 13862.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 41.3 KiB |
Quantile statistics
| Minimum | 289 |
|---|---|
| 5-th percentile | 614 |
| Q1 | 823 |
| median | 1142 |
| Q3 | 1838.5 |
| 95-th percentile | 6064.8 |
| Maximum | 13862.4 |
| Range | 13573.4 |
| Interquartile range (IQR) | 1015.5 |
Descriptive statistics
| Standard deviation | 1935.3515 |
|---|---|
| Coefficient of variation (CV) | 1.0685269 |
| Kurtosis | 11.703609 |
| Mean | 1811.2333 |
| Median Absolute Deviation (MAD) | 451 |
| Skewness | 3.2187069 |
| Sum | 9547010.8 |
| Variance | 3745585.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 663 | 86 | 1.6% |
| 794 | 77 | 1.5% |
| 645 | 62 | 1.2% |
| 1636 | 61 | 1.2% |
| 944 | 53 | 1.0% |
| 674 | 44 | 0.8% |
| 7277 | 43 | 0.8% |
| 1052 | 42 | 0.8% |
| 1345 | 40 | 0.8% |
| 1440 | 38 | 0.7% |
| Other values (352) | 4725 |
| Value | Count | Frequency (%) |
| 289 | 13 | |
| 310 | 17 | |
| 369 | 9 | |
| 377 | 5 | 0.1% |
| 414 | 9 | |
| 423 | 21 | |
| 453 | 12 | |
| 466 | 2 | < 0.1% |
| 516 | 6 | 0.1% |
| 554 | 21 |
| Value | Count | Frequency (%) |
| 13862.4 | 12 | |
| 12285 | 1 | < 0.1% |
| 12258.8 | 17 | |
| 11552 | 8 | |
| 11320.2 | 13 | |
| 11046.6 | 7 | 0.1% |
| 10396.8 | 18 | |
| 10165 | 11 | |
| 9933.2 | 3 | 0.1% |
| 9066.8 | 14 |
SGOT
Real number (ℝ)
| Distinct | 195 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 113.58753 |
| Minimum | 26.35 |
|---|---|
| Maximum | 457.25 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 41.3 KiB |
Quantile statistics
| Minimum | 26.35 |
|---|---|
| 5-th percentile | 54.25 |
| Q1 | 75 |
| median | 106.95 |
| Q3 | 137.95 |
| 95-th percentile | 198.4 |
| Maximum | 457.25 |
| Range | 430.9 |
| Interquartile range (IQR) | 62.95 |
Descriptive statistics
| Standard deviation | 48.964789 |
|---|---|
| Coefficient of variation (CV) | 0.4310754 |
| Kurtosis | 6.8408319 |
| Mean | 113.58753 |
| Median Absolute Deviation (MAD) | 31 |
| Skewness | 1.6710329 |
| Sum | 598719.85 |
| Variance | 2397.5505 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 71.3 | 190 | 3.6% |
| 57.35 | 153 | 2.9% |
| 137.95 | 142 | 2.7% |
| 128.65 | 132 | 2.5% |
| 170.5 | 127 | 2.4% |
| 97.65 | 119 | 2.3% |
| 120.9 | 116 | 2.2% |
| 93 | 113 | 2.1% |
| 106.95 | 97 | 1.8% |
| 122.45 | 87 | 1.7% |
| Other values (185) | 3995 |
| Value | Count | Frequency (%) |
| 26.35 | 5 | 0.1% |
| 28.38 | 13 | 0.2% |
| 41.85 | 16 | 0.3% |
| 43.4 | 24 | |
| 45 | 12 | 0.2% |
| 46.5 | 6 | 0.1% |
| 49 | 1 | < 0.1% |
| 49.6 | 38 | |
| 50 | 1 | < 0.1% |
| 51.15 | 50 |
| Value | Count | Frequency (%) |
| 457.25 | 14 | |
| 338 | 5 | 0.1% |
| 328.6 | 7 | |
| 299.15 | 4 | 0.1% |
| 288 | 8 | |
| 283.05 | 1 | < 0.1% |
| 280.55 | 12 | |
| 272.8 | 7 | |
| 267 | 1 | < 0.1% |
| 261.86 | 1 | < 0.1% |
Tryglicerides
Real number (ℝ)
| Distinct | 155 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 115.28609 |
| Minimum | 33 |
|---|---|
| Maximum | 598 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 41.3 KiB |
Quantile statistics
| Minimum | 33 |
|---|---|
| 5-th percentile | 56 |
| Q1 | 84 |
| median | 104 |
| Q3 | 138 |
| 95-th percentile | 210 |
| Maximum | 598 |
| Range | 565 |
| Interquartile range (IQR) | 54 |
Descriptive statistics
| Standard deviation | 52.60278 |
|---|---|
| Coefficient of variation (CV) | 0.45628036 |
| Kurtosis | 12.672583 |
| Mean | 115.28609 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | 2.4450711 |
| Sum | 607673 |
| Variance | 2767.0525 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 168 | 3.2% |
| 85 | 155 | 2.9% |
| 118 | 141 | 2.7% |
| 68 | 137 | 2.6% |
| 91 | 134 | 2.5% |
| 56 | 126 | 2.4% |
| 101 | 122 | 2.3% |
| 55 | 118 | 2.2% |
| 108 | 112 | 2.1% |
| 104 | 108 | 2.0% |
| Other values (145) | 3950 |
| Value | Count | Frequency (%) |
| 33 | 14 | 0.3% |
| 44 | 26 | 0.5% |
| 46 | 8 | 0.2% |
| 49 | 10 | 0.2% |
| 50 | 23 | 0.4% |
| 52 | 25 | 0.5% |
| 53 | 14 | 0.3% |
| 55 | 118 | |
| 56 | 126 | |
| 57 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 598 | 6 | 0.1% |
| 432 | 15 | |
| 394 | 1 | < 0.1% |
| 382 | 6 | 0.1% |
| 328 | 1 | < 0.1% |
| 322 | 3 | 0.1% |
| 319 | 3 | 0.1% |
| 318 | 12 | |
| 309 | 9 | |
| 280 | 13 |
Platelets
Real number (ℝ)
| Distinct | 223 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 264.02371 |
| Minimum | 62 |
|---|---|
| Maximum | 563 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 41.3 KiB |
Quantile statistics
| Minimum | 62 |
|---|---|
| 5-th percentile | 128 |
| Q1 | 209 |
| median | 259 |
| Q3 | 317 |
| 95-th percentile | 430 |
| Maximum | 563 |
| Range | 501 |
| Interquartile range (IQR) | 108 |
Descriptive statistics
| Standard deviation | 87.584068 |
|---|---|
| Coefficient of variation (CV) | 0.33172803 |
| Kurtosis | 0.34752348 |
| Mean | 264.02371 |
| Median Absolute Deviation (MAD) | 53 |
| Skewness | 0.42541087 |
| Sum | 1391669 |
| Variance | 7670.9689 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 344 | 160 | 3.0% |
| 336 | 111 | 2.1% |
| 181 | 108 | 2.0% |
| 213 | 106 | 2.0% |
| 265 | 92 | 1.7% |
| 268 | 90 | 1.7% |
| 295 | 89 | 1.7% |
| 165 | 87 | 1.7% |
| 251 | 86 | 1.6% |
| 231 | 85 | 1.6% |
| Other values (213) | 4257 |
| Value | Count | Frequency (%) |
| 62 | 12 | |
| 70 | 12 | |
| 71 | 11 | |
| 75 | 1 | < 0.1% |
| 76 | 2 | < 0.1% |
| 79 | 7 | |
| 80 | 16 | |
| 81 | 9 | |
| 88 | 1 | < 0.1% |
| 92 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 563 | 26 | |
| 539 | 4 | 0.1% |
| 518 | 9 | 0.2% |
| 517 | 3 | 0.1% |
| 516 | 1 | < 0.1% |
| 514 | 9 | 0.2% |
| 493 | 6 | 0.1% |
| 487 | 8 | 0.2% |
| 474 | 7 | 0.1% |
| 471 | 16 |
Prothrombin
Real number (ℝ)
| Distinct | 47 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.632865 |
| Minimum | 9 |
|---|---|
| Maximum | 15.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 41.3 KiB |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 9.6 |
| Q1 | 10 |
| median | 10.6 |
| Q3 | 11 |
| 95-th percentile | 12.05 |
| Maximum | 15.2 |
| Range | 6.2 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.79271131 |
|---|---|
| Coefficient of variation (CV) | 0.074552939 |
| Kurtosis | 2.3426417 |
| Mean | 10.632865 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 1.1209872 |
| Sum | 56045.83 |
| Variance | 0.62839122 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10.6 | 739 | 14.0% |
| 11 | 549 | 10.4% |
| 10 | 408 | 7.7% |
| 9.9 | 344 | 6.5% |
| 9.8 | 290 | 5.5% |
| 10.1 | 234 | 4.4% |
| 10.9 | 218 | 4.1% |
| 10.2 | 192 | 3.6% |
| 11.5 | 188 | 3.6% |
| 10.3 | 185 | 3.5% |
| Other values (37) | 1924 |
| Value | Count | Frequency (%) |
| 9 | 11 | 0.2% |
| 9.1 | 6 | 0.1% |
| 9.2 | 9 | 0.2% |
| 9.3 | 2 | < 0.1% |
| 9.4 | 14 | 0.3% |
| 9.5 | 112 | 2.1% |
| 9.6 | 180 | |
| 9.7 | 144 | |
| 9.8 | 290 | |
| 9.9 | 344 |
| Value | Count | Frequency (%) |
| 15.2 | 8 | 0.2% |
| 14.1 | 3 | 0.1% |
| 13.6 | 9 | 0.2% |
| 13.3 | 2 | < 0.1% |
| 13.2 | 28 | |
| 13.1 | 1 | < 0.1% |
| 13 | 43 | |
| 12.9 | 22 | |
| 12.8 | 1 | < 0.1% |
| 12.7 | 17 | 0.3% |
Stage
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 309.0 KiB |
| 3.0 | |
|---|---|
| 4.0 | |
| 2.0 | |
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 15813 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 4.0 |
| 4th row | 2.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 3.0 | 2122 | |
| 4.0 | 1792 | |
| 2.0 | 1117 | |
| 1.0 | 240 | 4.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3.0 | 2122 | |
| 4.0 | 1792 | |
| 2.0 | 1117 | |
| 1.0 | 240 | 4.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 5271 | |
| 0 | 5271 | |
| 3 | 2122 | |
| 4 | 1792 | 11.3% |
| 2 | 1117 | 7.1% |
| 1 | 240 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10542 | |
| Other Punctuation | 5271 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5271 | |
| 3 | 2122 | |
| 4 | 1792 | 17.0% |
| 2 | 1117 | 10.6% |
| 1 | 240 | 2.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5271 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15813 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 5271 | |
| 0 | 5271 | |
| 3 | 2122 | |
| 4 | 1792 | 11.3% |
| 2 | 1117 | 7.1% |
| 1 | 240 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15813 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 5271 | |
| 0 | 5271 | |
| 3 | 2122 | |
| 4 | 1792 | 11.3% |
| 2 | 1117 | 7.1% |
| 1 | 240 | 1.5% |
took_drug
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 298.7 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5271 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2694 | |
| 1 | 2577 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2694 | |
| 1 | 2577 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2694 | |
| 1 | 2577 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5271 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2694 | |
| 1 | 2577 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5271 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2694 | |
| 1 | 2577 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5271 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2694 | |
| 1 | 2577 |
Edema_N
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 298.7 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5271 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 4786 | |
| 0 | 485 | 9.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 4786 | |
| 0 | 485 | 9.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 4786 | |
| 0 | 485 | 9.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5271 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4786 | |
| 0 | 485 | 9.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5271 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 4786 | |
| 0 | 485 | 9.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5271 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 4786 | |
| 0 | 485 | 9.2% |
Edema_S
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 298.7 KiB |
| 0 | |
|---|---|
| 1 | 262 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5271 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 5009 | |
| 1 | 262 | 5.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 5009 | |
| 1 | 262 | 5.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5009 | |
| 1 | 262 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5271 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5009 | |
| 1 | 262 | 5.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5271 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5009 | |
| 1 | 262 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5271 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5009 | |
| 1 | 262 | 5.0% |
Edema_Y
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 298.7 KiB |
| 0 | |
|---|---|
| 1 | 223 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5271 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 5048 | |
| 1 | 223 | 4.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 5048 | |
| 1 | 223 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5048 | |
| 1 | 223 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5271 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5048 | |
| 1 | 223 | 4.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5271 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5048 | |
| 1 | 223 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5271 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5048 | |
| 1 | 223 | 4.2% |
| Age | Albumin | Alk_Phos | Ascites | Bilirubin | Cholesterol | Copper | Edema_N | Edema_S | Edema_Y | Hepatomegaly | N_years | Platelets | Prothrombin | SGOT | Spiders | Stage | Tryglicerides | is_male | took_drug | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | -0.075 | -0.057 | 0.175 | 0.075 | -0.086 | 0.047 | 0.202 | 0.133 | 0.201 | 0.122 | -0.128 | -0.123 | 0.142 | -0.032 | 0.095 | 0.093 | 0.003 | 0.162 | 0.120 |
| Albumin | -0.075 | 1.000 | -0.185 | 0.388 | -0.294 | -0.061 | -0.270 | 0.321 | 0.055 | 0.411 | 0.271 | 0.248 | 0.126 | -0.179 | -0.224 | 0.232 | 0.154 | -0.108 | 0.061 | 0.112 |
| Alk_Phos | -0.057 | -0.185 | 1.000 | 0.148 | 0.332 | 0.314 | 0.289 | 0.107 | 0.096 | 0.137 | 0.198 | -0.140 | 0.047 | 0.086 | 0.425 | 0.088 | 0.062 | 0.206 | 0.053 | 0.050 |
| Ascites | 0.175 | 0.388 | 0.148 | 1.000 | 0.257 | -0.109 | 0.212 | 0.494 | 0.062 | 0.638 | 0.186 | -0.230 | -0.176 | 0.265 | 0.107 | 0.195 | 0.192 | 0.108 | 0.000 | 0.000 |
| Bilirubin | 0.075 | -0.294 | 0.332 | 0.257 | 1.000 | 0.338 | 0.588 | 0.333 | 0.149 | 0.374 | 0.338 | -0.388 | -0.168 | 0.274 | 0.498 | 0.306 | 0.141 | 0.297 | 0.112 | 0.082 |
| Cholesterol | -0.086 | -0.061 | 0.314 | -0.109 | 0.338 | 1.000 | 0.263 | 0.061 | 0.018 | 0.065 | 0.130 | -0.115 | 0.127 | -0.049 | 0.355 | 0.096 | 0.019 | 0.342 | 0.050 | 0.078 |
| Copper | 0.047 | -0.270 | 0.289 | 0.212 | 0.588 | 0.263 | 1.000 | 0.287 | 0.125 | 0.310 | 0.321 | -0.339 | -0.117 | 0.226 | 0.439 | 0.279 | 0.142 | 0.322 | 0.163 | 0.075 |
| Edema_N | 0.202 | 0.321 | 0.107 | 0.494 | 0.333 | 0.061 | 0.287 | 1.000 | 0.717 | 0.659 | 0.208 | 0.226 | 0.173 | -0.288 | -0.103 | 0.245 | 0.210 | -0.087 | 0.000 | 0.000 |
| Edema_S | 0.133 | 0.055 | 0.096 | 0.062 | 0.149 | 0.018 | 0.125 | 0.717 | 1.000 | 0.044 | 0.116 | -0.093 | -0.065 | 0.146 | 0.041 | 0.133 | 0.100 | 0.037 | 0.018 | 0.000 |
| Edema_Y | 0.201 | 0.411 | 0.137 | 0.638 | 0.374 | 0.065 | 0.310 | 0.659 | 0.044 | 1.000 | 0.171 | -0.224 | -0.179 | 0.256 | 0.103 | 0.206 | 0.191 | 0.085 | 0.000 | 0.000 |
| Hepatomegaly | 0.122 | 0.271 | 0.198 | 0.186 | 0.338 | 0.130 | 0.321 | 0.208 | 0.116 | 0.171 | 1.000 | -0.246 | -0.185 | 0.263 | 0.217 | 0.320 | 0.508 | 0.172 | 0.039 | 0.059 |
| N_years | -0.128 | 0.248 | -0.140 | -0.230 | -0.388 | -0.115 | -0.339 | 0.226 | -0.093 | -0.224 | -0.246 | 1.000 | 0.128 | -0.138 | -0.244 | 0.267 | 0.168 | -0.181 | 0.058 | 0.068 |
| Platelets | -0.123 | 0.126 | 0.047 | -0.176 | -0.168 | 0.127 | -0.117 | 0.173 | -0.065 | -0.179 | -0.185 | 0.128 | 1.000 | -0.180 | -0.017 | 0.214 | 0.139 | 0.015 | 0.031 | 0.073 |
| Prothrombin | 0.142 | -0.179 | 0.086 | 0.265 | 0.274 | -0.049 | 0.226 | -0.288 | 0.146 | 0.256 | 0.263 | -0.138 | -0.180 | 1.000 | 0.136 | 0.314 | 0.203 | 0.007 | 0.059 | 0.090 |
| SGOT | -0.032 | -0.224 | 0.425 | 0.107 | 0.498 | 0.355 | 0.439 | -0.103 | 0.041 | 0.103 | 0.217 | -0.244 | -0.017 | 0.136 | 1.000 | 0.175 | 0.078 | 0.164 | 0.064 | 0.090 |
| Spiders | 0.095 | 0.232 | 0.088 | 0.195 | 0.306 | 0.096 | 0.279 | 0.245 | 0.133 | 0.206 | 0.320 | 0.267 | 0.214 | 0.314 | 0.175 | 1.000 | 0.306 | 0.056 | 0.030 | 0.012 |
| Stage | 0.093 | 0.154 | 0.062 | 0.192 | 0.141 | 0.019 | 0.142 | 0.210 | 0.100 | 0.191 | 0.508 | 0.168 | 0.139 | 0.203 | 0.078 | 0.306 | 1.000 | 0.089 | 0.018 | 0.000 |
| Tryglicerides | 0.003 | -0.108 | 0.206 | 0.108 | 0.297 | 0.342 | 0.322 | -0.087 | 0.037 | 0.085 | 0.172 | -0.181 | 0.015 | 0.007 | 0.164 | 0.056 | 0.089 | 1.000 | 0.091 | 0.072 |
| is_male | 0.162 | 0.061 | 0.053 | 0.000 | 0.112 | 0.050 | 0.163 | 0.000 | 0.018 | 0.000 | 0.039 | 0.058 | 0.031 | 0.059 | 0.064 | 0.030 | 0.018 | 0.091 | 1.000 | 0.039 |
| took_drug | 0.120 | 0.112 | 0.050 | 0.000 | 0.082 | 0.078 | 0.075 | 0.000 | 0.000 | 0.000 | 0.059 | 0.068 | 0.073 | 0.090 | 0.090 | 0.012 | 0.000 | 0.072 | 0.039 | 1.000 |
| N_years | Age | is_male | Ascites | Hepatomegaly | Spiders | Bilirubin | Cholesterol | Albumin | Copper | Alk_Phos | SGOT | Tryglicerides | Platelets | Prothrombin | Stage | took_drug | Edema_N | Edema_S | Edema_Y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 10.517808 | 54.038356 | 0.0 | 0 | 1 | 0 | 1.2 | 546.0 | 3.37 | 65.0 | 1636.0 | 151.90 | 90.0 | 430.0 | 10.6 | 2.0 | 1 | 1 | 0 | 0 |
| 1 | 6.761644 | 41.027397 | 0.0 | 0 | 0 | 0 | 1.1 | 660.0 | 4.22 | 94.0 | 1257.0 | 151.90 | 155.0 | 227.0 | 10.0 | 2.0 | 1 | 1 | 0 | 0 |
| 2 | 0.139726 | 36.024658 | 0.0 | 0 | 1 | 0 | 2.0 | 151.0 | 2.96 | 46.0 | 961.0 | 69.75 | 101.0 | 213.0 | 13.0 | 4.0 | 0 | 0 | 0 | 1 |
| 3 | 6.383562 | 56.191781 | 0.0 | 0 | 0 | 0 | 0.6 | 293.0 | 3.85 | 40.0 | 554.0 | 125.55 | 56.0 | 270.0 | 10.6 | 2.0 | 1 | 1 | 0 | 0 |
| 4 | 4.424658 | 60.010959 | 0.0 | 0 | 1 | 0 | 1.4 | 277.0 | 2.97 | 121.0 | 1110.0 | 125.00 | 126.0 | 221.0 | 9.8 | 1.0 | 1 | 1 | 0 | 0 |
| 5 | 3.926027 | 56.191781 | 0.0 | 0 | 0 | 0 | 0.8 | 198.0 | 3.94 | 38.0 | 911.0 | 57.35 | 56.0 | 280.0 | 9.8 | 1.0 | 1 | 1 | 0 | 0 |
| 6 | 4.890411 | 52.219178 | 0.0 | 0 | 0 | 0 | 0.4 | 273.0 | 3.65 | 25.0 | 671.0 | 84.00 | 177.0 | 284.0 | 9.9 | 3.0 | 0 | 1 | 0 | 0 |
| 7 | 5.273973 | 54.778082 | 0.0 | 0 | 1 | 0 | 1.8 | 244.0 | 3.26 | 64.0 | 6121.8 | 60.63 | 92.0 | 183.0 | 10.3 | 4.0 | 1 | 0 | 1 | 0 |
| 8 | 0.112329 | 65.928767 | 0.0 | 1 | 1 | 0 | 17.9 | 178.0 | 2.10 | 220.0 | 705.0 | 338.00 | 229.0 | 62.0 | 12.9 | 4.0 | 1 | 1 | 0 | 0 |
| 9 | 4.835616 | 78.493151 | 1.0 | 0 | 1 | 0 | 6.4 | 243.0 | 3.35 | 380.0 | 983.0 | 158.10 | 154.0 | 97.0 | 11.2 | 2.0 | 1 | 0 | 1 | 0 |
| N_years | Age | is_male | Ascites | Hepatomegaly | Spiders | Bilirubin | Cholesterol | Albumin | Copper | Alk_Phos | SGOT | Tryglicerides | Platelets | Prothrombin | Stage | took_drug | Edema_N | Edema_S | Edema_Y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5261 | 2.950685 | 53.342466 | 1.0 | 0 | 1 | 0 | 2.2 | 572.0 | 3.85 | 94.0 | 2184.0 | 85.25 | 154.0 | 278.0 | 11.0 | 2.0 | 1 | 1 | 0 | 0 |
| 5262 | 4.268493 | 44.427397 | 0.0 | 0 | 0 | 0 | 0.7 | 217.0 | 3.46 | 52.0 | 1031.0 | 85.25 | 195.0 | 224.0 | 9.8 | 3.0 | 0 | 1 | 0 | 0 |
| 5263 | 6.389041 | 62.564384 | 0.0 | 0 | 0 | 0 | 0.6 | 235.0 | 3.70 | 23.0 | 834.0 | 75.95 | 56.0 | 165.0 | 10.6 | 2.0 | 0 | 1 | 0 | 0 |
| 5264 | 3.380822 | 62.457534 | 0.0 | 0 | 1 | 0 | 4.5 | 328.0 | 3.26 | 75.0 | 1877.0 | 93.00 | 134.0 | 234.0 | 11.1 | 4.0 | 0 | 1 | 0 | 0 |
| 5265 | 7.049315 | 42.772603 | 0.0 | 0 | 0 | 0 | 0.8 | 217.0 | 3.85 | 40.0 | 685.0 | 88.35 | 130.0 | 281.0 | 9.8 | 3.0 | 0 | 1 | 0 | 0 |
| 5266 | 7.863014 | 33.641096 | 0.0 | 0 | 0 | 0 | 1.3 | 302.0 | 3.43 | 75.0 | 1345.0 | 145.00 | 44.0 | 181.0 | 10.6 | 3.0 | 0 | 1 | 0 | 0 |
| 5267 | 4.849315 | 67.953425 | 0.0 | 0 | 0 | 0 | 0.5 | 219.0 | 4.09 | 121.0 | 663.0 | 79.05 | 94.0 | 311.0 | 9.7 | 3.0 | 0 | 1 | 0 | 0 |
| 5268 | 10.156164 | 46.547945 | 0.0 | 0 | 1 | 0 | 0.8 | 315.0 | 4.09 | 13.0 | 1637.0 | 170.50 | 70.0 | 426.0 | 10.9 | 3.0 | 1 | 1 | 0 | 0 |
| 5269 | 3.331507 | 32.254795 | 0.0 | 0 | 0 | 0 | 0.7 | 329.0 | 3.80 | 52.0 | 678.0 | 57.00 | 126.0 | 306.0 | 10.2 | 1.0 | 0 | 1 | 0 | 0 |
| 5270 | 6.224658 | 59.178082 | 0.0 | 0 | 0 | 0 | 2.0 | 232.0 | 3.42 | 18.0 | 1636.0 | 170.50 | 83.0 | 213.0 | 13.6 | 2.0 | 1 | 1 | 0 | 0 |